Generic control layer in a cloud environment

ABSTRACT

A computing system implemented method, in one embodiment, can include a cloud control module receiving a constraint for cloud architecture. In addition, the method can include the cloud control module receiving a plurality of cloud service provider capabilities. Furthermore, the method can include the cloud control module filtering the plurality of cloud service provider capabilities to identify a cloud service provider capable of satisfying the constraint. Moreover, the method can include the cloud control module outputting an instruction for a resource from the cloud service provider.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation (and claims the benefit of priorityunder 35 USC 120) of U.S. application Ser. No. 13/071,442 filed Mar. 24,2011, now allowed, which claims the benefit to U.S. ProvisionalApplication No. 61/321,761, filed Apr. 7, 2010, entitled “GenericControl Layer in a Cloud Environment.” Both of these prior applicationsare hereby incorporated by reference in their entirety.

BACKGROUND

Cloud computing is appealing to most enterprises because of its promiseof commodity and on-demand utility computing. However, cloud computingfaces key barriers to adoption. For example, key policy issue barrierscan include legal, indemnity, and compliance. In addition, keytechnology issue barriers can involve security, reliability, andperformance. Furthermore, what makes navigating these issues difficultis that cloud-based application architecture is confusing. First, theterm “cloud” is overloaded and used by many vendors each with vastlydiffering offerings. Second, the on-demand and scalability features ofcloud computing are new and architects are still working to understandhow to leverage these capabilities.

SUMMARY

A computing system implemented method, in one embodiment, can include acloud control module receiving a constraint for cloud architecture. Inaddition, the method can include the cloud control module receiving aplurality of cloud service provider capabilities. Furthermore, themethod can include the cloud control module filtering the plurality ofcloud service provider capabilities to identify a cloud service providercapable of satisfying the constraint. Moreover, the method can includethe cloud control module outputting an instruction for a resource fromthe cloud service provider. Note that in various embodiments, theinstruction for a resource can be implemented as, but is not limited to,a number of appliances to provision from the cloud service provider, aselection of a vendor provided platform from the cloud service provider,and/or a selection of a vendor provided software application from thecloud service provider.

While particular embodiments in accordance with the disclosure have beenspecifically described within this Summary, it is noted that thedisclosure and the claimed subject matter are not limited in any way bythese embodiments.

BRIEF DESCRIPTION OF THE DRAWINGS

Within the accompanying drawings, various embodiments in accordance withthe disclosure are illustrated by way of example and not by way oflimitation. It is noted that like reference numerals denote similarelements throughout the drawings.

FIG. 1 is a cloud management system in accordance with variousembodiments of the disclosure.

FIG. 2 illustrates example logic or functionality of a cloud managementsystem in accordance with various embodiments of the disclosure.

FIG. 3 is an example deployment in accordance with various embodimentsof the disclosure.

FIG. 4 illustrates an example of creating custom services on top ofcommodity cloud and data center offerings in accordance with variousembodiments of the disclosure.

FIG. 5 illustrates an example of how to fulfill client requirements at acloud control layer using less reliable data center and external cloudparts in accordance with various embodiments of the disclosure.

FIG. 6 is a user interface for configuring a cloud control layer inaccordance with various embodiments of the disclosure.

FIG. 7 illustrates both the “Specify Traffic” tab and the“Traffic-Application” sub tab selected within the user interface inaccordance with various embodiments of the disclosure.

FIG. 8 illustrates the “Specify Deployment” tab selected within the userinterface in accordance with various embodiments of the disclosure.

FIG. 9 illustrates the “View Traffic SLA Status” tab selected within theuser interface in accordance with various embodiments of the disclosure.

FIG. 10 illustrates the “View Deployment SLA Status” tab selected withinthe user interface in accordance with various embodiments of thedisclosure.

FIG. 11 illustrates the “Update Settings” tab selected within the userinterface in accordance with various embodiments of the disclosure.

FIG. 12 illustrates both the “Update Settings” tab and the “Application”sub-tab selected within the user interface in accordance with variousembodiments of the disclosure.

FIG. 13 is a block diagram of an exemplary computing system that may beused in accordance with various embodiments of the disclosure.

DETAILED DESCRIPTION

Reference will now be made in detail to various embodiments inaccordance with the disclosure, examples of which are illustrated in theaccompanying drawings. While the disclosure will be described inconjunction with various embodiments, it will be understood that thesevarious embodiments are not intended to limit the disclosure. On thecontrary, the disclosure is intended to cover alternatives,modifications and equivalents, which may be included within the scope ofthe disclosure. Furthermore, in the following detailed description ofvarious embodiments in accordance with the disclosure, numerous specificdetails are set forth in order to provide a thorough understanding ofthe disclosure. However, it will be evident to one of ordinary skill inthe art that the disclosure may be practiced without these specificdetails. In other instances, well known methods, procedures, components,and circuits have not been described in detail as not to unnecessarilyobscure aspects of the disclosure.

Some portions of the detailed descriptions that follow are presented interms of procedures, logic blocks, processing, and other symbolicrepresentations of operations on data bits within a computer memory.These descriptions and representations are the means used by thoseskilled in the data processing arts to most effectively convey thesubstance of their work to others skilled in the art. In the presentapplication, a procedure, logic block, process, or the like, isconceived to be a self-consistent sequence of operations or steps orinstructions leading to a desired result. The operations or steps arethose utilizing physical manipulations of physical quantities. Usually,although not necessarily, these quantities can take the form ofelectrical or magnetic signals capable of being stored, transferred,combined, compared, and otherwise manipulated in a computer system orcomputing device. It has proven convenient at times, principally forreasons of common usage, to refer to these signals as transactions,bits, values, elements, symbols, characters, samples, pixels, or thelike.

It is noted that in one embodiment in accordance with the disclosure, ageneral description of “cloud” is a collection of network-hostedservices accessible from almost anywhere with the following attributes:elastically scalable, illusion of infinite capacity, availableon-demand, and consumption-based charges with no upfront commitment, butis not limited to such. This description can apply to all levels of thecloud (e.g., process, software application, compute platform, or virtualmachine hosting).

FIG. 1 is a cloud management system 100 in accordance with variousembodiments of the disclosure. The cloud management system 100 caninclude, but is not limited to, a cloud transformation layer 102, acloud control layer 104, and a cloud instantiation layer 106. Within thepresent embodiment, each layer can be a set of information technology(IT) functions that deliver conceptually similar services within part ofthe overall architecture. In one embodiment, the functions of each layerinterface directly with those of adjacent layers. Within the cloudmanagement system 100, details of other layers are decoupled andabstracted to separate concerns; each layer focuses on ensuring its setof services. Note that in one embodiment, the cloud transformation layer102, the cloud control layer 104, and the cloud instantiation layer 106can also be referred to as the cloud transformation module 102, thecloud control module 104, and the cloud instantiation module 106,respectively, but are not limited to such.

In various embodiments, one or more of the operations of the cloudmanagement system 100 can be performed by software, by firmware, byhardware, or by any combination thereof, but is not limited to such. Thecloud management system 100 can include processes of embodiments of thedisclosure which can be carried out by a processor(s) and electricalcomponents under the control of computer or computing device readableand executable instructions (or code). The computer or computing devicereadable and executable instructions (or code) may reside, for example,in data storage features such as computer or computing device usablevolatile memory, computer or computing device usable non-volatilememory, and/or computer or computing device usable mass data storage.However, the computer or computing device readable and executableinstructions (or code) may reside in any type of computer or computingdevice readable medium.

Within FIG. 1, in one embodiment the cloud transformation layer 102transforms user-generated application program and/or data representationinto the cloud consumable representation. Example transformations caninclude, but are not limited to, data encryption, encoding for errorcorrection, formatting, code recompiling, code augmentation with thecloud provider specific Application Programming Interface (API),segmenting large data objects into smaller units, reformatting datastructures into XML (Extensible Markup Language), or applying businesslogic to structure data. Likewise example transformations applied tocode can include, but are not limited to, recompiling, translating intoa different language, or augmenting with additional code and/orcloud-provider specific API (e.g., replacing a general “get” or “put”statement requesting storage and retrieval with the API specific to thecloud service provider's storage service). The processing by thetransformation layer 102 can complete the program for execution on thespecific compute environment of the cloud.

Within FIG. 1, the cloud control layer 104 in an embodiment canprescribe the best possible or appropriate composition of an existingvendor provided platform and/or set of virtual machine appliances neededto form the desired compute platform. The control layer 104 candetermine the quantity, type, and location of virtual appliances tocreate the desired compute platform. More than one-time provisioning,the control layer 104 can determine how and when to scale the computeplatform to meet guarantees around service level agreements (SLAs),compliance, cost, and capability. Note that the control layer 104 canassign the formatted content from the transformation layer 102 acrossthe prescribed deployment. The control layer 104 can ultimatelydetermine how the cloud compute platform is fulfilled.

The cloud Instantiation Layer 106 in one embodiment can execute thespecification output by the cloud control layer 104 (e.g., automation ofscripts provision, maintain, and tear-down the prescribed deployment ofvirtual appliances). It is noted that as part of provisioning, thescripts can add the assigned content, software, data, configurations,and connectivity to form the application platform. For example in anembodiment, the cloud Instantiation Layer 106 can apply the InternetProtocol (IP) address and update the Domain Name System (DNS). Workingwith the Control Layer 104, the Instantiation Layer 106 can passmonitored statistics and then implement the updates when sourcingarrangements are recomputed. The Instantiation Layer 106 can also governhow provisioning and configuration occurs specific to the cloud-basedservice provider (e.g., data center, EC2, or GoGrid).

Within FIG. 1, the Control Layer 104 in an embodiment can determine thesource, scale, and types of cloud and data center services needed inorder to meet platform guarantees. As such, the Control Layer 104 can bedynamic and responsive to either changes in demand or conditions (e.g.,it is able to respond to a longer than expected processing times orshortened turn around window).

Within an embodiment, the scaling algorithms or procedures of the cloudControl Layer 104 can be generic and independent of application orplatform. In networking, the Transmission Control Protocol (TCP) canprovide reliable end-to-end delivery of a packet stream regardless ofthe content being delivered or the network implementation. In a similarmanner, the cloud Control Layer 104 algorithms or procedures can sourceand possibly scale the compute platform without knowledge of the content(e.g., the application program and data), or the details of theinstantiation or implementation of the virtual machine appliances orvendor packaged platform. The sourcing algorithm or procedure works toprioritize preferred providers and filter out non-compliant providersthat do not fulfill application requirements; then the scaling algorithmor procedure works to meet availability and performance objectives. Theinstantiation of the outputs prescribed by the Control Layer 104 areleft to other orchestration mechanisms (e.g., provision and configurethe virtual machine appliances).

FIG. 2 illustrates example logic or functionality of a cloud managementsystem 200 in accordance with various embodiments of the disclosure.Specifically, FIG. 2 includes a flow diagram of example operations for acloud Transformation Layer 202, a cloud Control Layer 204, and a cloudInstantiation Layer 206 in accordance with various embodiments of thedisclosure. It is noted that in one embodiment, the example operationsor functions of the cloud Transformation Layer 202, the cloud ControlLayer 204, and the cloud Instantiation Layer 206 can be implemented aspart of the cloud Transformation Layer 102, the cloud Control Layer 104,and the cloud Instantiation Layer 106, respectively, of FIG. 1, but arenot limited to such.

In an embodiment, logic or functionality of the Cloud Control Layer 204of the cloud management system 200 can consider various desiredattributes like, but not limited to, performance, availability, desiredcost, and compliance; and chooses the best set and number of resourcesbased on the offered capability of providers or vendors with definedterms like, but not limited to, capability, average capacity, maximumcapacity, start-up time, availability, published cost, and/or location.It is pointed out that a provider's values for these terms may be basedon published metrics or values, or estimated from monitored and measuredmetrics collected over time, or collected measurements of the provider,or any combination thereof. Note that a provider's vector of values forperforming a given capability is enough to summarize the applicationconstraints for the Control Layer 204. Knowledge of the specificimplementation within the virtual machine appliance or vendor packagedplatform is not needed. In addition, note that when the capability issummarized, the algorithms (or functions or procedures) of the cloudControl Layer 204 do not deal directly with virtual machine applianceimplementation.

Within FIG. 2, given the above application attributes and providercapability, the cloud Control Layer 204 can filter the providers orvendors by their ability to satisfy constraint and capability. Fromthese preferred vendors, the logic of the cloud Control Layer 204 candetermine scale and size to source from each vendor. This determinationby the cloud Control Layer 204 can include the assignment of roles,e.g., primary and redundant, or worker and aggregator. The output of thecloud Control Layer 204 can be an instruction or instructions for one ormore resources from the preferred vendors to the cloud InstantiationLayer 206. Note that in various embodiments, the instruction(s) for aresource(s) can be implemented as, but not limited to, a number and typeof virtual machine appliances from each vendor for the cloudInstantiation Layer 206 to provision, a selection of a vendor providedplatform from the cloud service provider, and/or a selection of a vendorprovided software application from the cloud service provider.Therefore, the output of the cloud Control Layer 204 may prescribe avendor package platform that serves the functionality, the details ofbuilding the platform, a vendor pre-bundled software application, and/ora number of virtual machine appliances that augment or replicate thisfunctionality, but is not limited to such.

When determined to build a platform, the logic of the cloud ControlLayer 204 can stipulate the scaling to meet performance andavailability. For example in an embodiment, the cloud Control Layer 204chooses a number of primary active nodes needed to meet performanceconstraints. Then, a number of additional redundant nodes needed toguarantee availability is determined by the cloud Control Layer 204. TheCloud Control Layer 204 can take these numbers along with a costcalculation that multiplies the number of resources and their costthereby resulting in a system of equations or procedures for determiningthe required resources.

Or instead of or in addition to building a platform, the logic of thecloud Control Layer 204 can stipulate a vendor packaged platform to meetperformance and availability. For example in an embodiment, the cloudControl Layer 204 compares the price of a vendor packaged cloud-basedsoftware application service that satisfies SLA constraints with that ofa self-sourced version. Then the cloud Control Layer 204 can determinethe vendor packaged platform, self-sourced platform, or any combinationthat minimizes costs needed to guarantee performance and availability.

Within FIG. 2, in one embodiment, controller logic of the Cloud ControlLayer 204 can determine the number and location of the virtualappliances needed to service the compute prepared by the cloudTransformation Layer 202. Logic of the Cloud Control Layer 204 canfilter the provided appliances and packaged platforms based oncapability to serve and on constraints (e.g., remaining within thecountry or the data center). Then the logic of the Cloud Control Layer204 selects the appropriate number and type of appliances or packagedplatforms based on preferences and to meet desired guarantees. With thehelp of constant monitoring from the cloud Instantiation Layer 206,algorithms of the Cloud Control Layer 204 can assign the files to beprocessed, scale the number of virtual appliances, and re-assign filesin case of delay or failure to meet time constraints. For example in anembodiment, if the processing is taking longer than expected, the logicof the Cloud Control Layer 204 signals that an additional virtualappliance is required. By implementing the logic at the Cloud ControlLayer 204, there is control over the placement of the compute as well asthe time constraint and availability.

Based on the determination of the Cloud Control Layer 204, the cloudInstantiation Layer 206 automates the appropriate scripts to provisionthe appliances or packaged platform, connect the resulting appliances orplatforms, and tear-down appliances or platforms when finished. It isnoted that in one embodiment, the cloud Instantiation Layer 206 can usea script that installs a pre-defined image of the appliances, but is notlimited to such. Implementation of logic at the Cloud Control Layer 204can determine the best possible or appropriate sourcing in terms of thequantity and placement of the appliances or packaged platforms andallows realization of custom attributes.

Within FIG. 2, the cloud Instantiation Layer 206 can provision andmanage the deployment specified by the Cloud Control Layer 204.Functions at the cloud Instantiation Layer 206 can determine whatconfiguration settings are applied. In an embodiment, the cloudInstantiation Layer 206 can use application programming interfaces(APIs) and scripts for each data center or cloud provider (or vendor) toset-up and configure the requested virtual machines or packagedplatforms. The cloud Instantiation Layer 206 can also consolidate themonitoring across service providers (or vendors) that is passed back tothe Cloud Control Layer 204. In one embodiment, for these actions,service automation products may be leveraged like Rightscale orAppistry, but is not limited to such. Note that changes in these actions(e.g., update Amazon credentials) can result in a change in the scriptused at the cloud Instantiation Layer 206 without impacting the CloudControl Layer 204.

Within the cloud management system 200, in one embodiment, securitypresents different concerns at each of the Cloud Transformation Layer202, the Cloud Control Layer 204, and the Cloud Instantiation Layer 206.For example, the Cloud Instantiation Layer 206 can apply network portconfigurations to minimize intrusion, can constantly monitor againstcompromise or network flooding, and can signal the Cloud Control Layer204 for updates when adverse conditions are detected. In addition, theCloud Transformation Layer 202 can provide encryption. As such, eachlayer adds a different level of protection.

Within FIG. 2, it is pointed out that in an embodiment, scripts of theCloud Instantiation Layer 206 can check the health of provisionedmachines and automatically re-provision in the case of failures. Or, inthe case of vendor packaged platforms, the Instantiation Layer 206checks on the health and automatically switches to a pre-determinedback-up platform when needed. Logic of the Cloud Control Layer 204 canact on real-time monitoring to update the prescribed deployment withadequate backup capability. Additionally, the Cloud Transformation Layer202 can apply error correction codes.

At operation 210 of the cloud management system 200, the cloud controllayer 204 can receive business service level agreements (SLAs). It isnoted that operation 210 can be implemented in a wide variety of ways.For example in one embodiment, the business service level agreements canbe received by the cloud control layer 204 at operation 210 from a userinterface of a computing device or system. In an embodiment, thebusiness service level agreements can be read in at operation 210 from afile from the configuration management database. Note that operation 210can be implemented in any manner similar to that described herein, butis not limited to such.

At operation 230 of FIG. 2, the cloud Instantiation Layer 206 cancontinuously monitor one or more service providers (or vendors) based onthe received business service level agreements and then pass or sendthose monitored statistics to the cloud control layer 204. Note thatoperation 230 can be implemented in a wide variety of ways. For example,operation 230 can be implemented in any manner similar to that describedherein, but is not limited to such.

At operation 212, the cloud control layer 204 can determine whether eachof the monitored statistics of the service level agreements issatisfying their corresponding defined threshold. If each threshold issatisfied, no action is taken at operation 212 by the cloud controllayer 204. However, if it is determined at operation 212 that one ormore thresholds are not satisfied, the cloud control layer 204 canproceed to operation 214. It is pointed out that operation 212 can beimplemented in a wide variety of ways. For example, operation 212 can beimplemented in any manner similar to that described herein, but is notlimited to such.

At operation 214 of FIG. 2, the cloud control layer 204 can determinethe bottleneck (or obstacle) causing the one or more service levelagreements to not satisfy their corresponding defined threshold. It isnoted that operation 214 can be implemented in a wide variety of ways.For example in an embodiment, the determination at operation 214 caninclude analyzing the received monitored statistics to identify one ormore virtual machine appliances and/or worker nodes that are causing thebottleneck, but is not limited to such. Operation 214 can be implementedin any manner similar to that described herein, but is not limited tosuch.

At operation 216, the cloud control layer 204 can determine whetherscaling the compute platform capacity will potentially eliminate the oneor more bottlenecks determined at operation 214. If so, the cloudcontrol layer 204 can proceed to operation 218. However, if it isdetermined at operation 216 that scaling the compute platform capacitywill not eliminate the one or more bottlenecks, the cloud control layer204 can proceed to operation 228. Note that operation 216 can beimplemented in a wide variety of ways. For example, operation 216 can beimplemented in any manner similar to that described herein, but is notlimited to such.

At operation 208 of FIG. 2, the cloud transformation layer 202 canreceive and transform code and data, which are then transmitted by thecloud transformation layer 202 to the cloud control layer 204. It ispointed out that operation 208 can be implemented in a wide variety ofways. For example, operation 208 can be implemented in any mannersimilar to that described herein, but is not limited to such.

At operation 228 of FIG. 2, the cloud control layer 204 utilizes thetransformed code and data together with sizing and sourcing informationin order to assign workload across the prescribed deployment. Inaddition, the cloud control layer 204 can transmit at operation 228 theworkload assignment information to the cloud instantiation layer 206. Itis noted that operation 228 can be implemented in a wide variety ofways. For example, operation 228 can be implemented in any mannersimilar to that described herein, but is not limited to such.

At operation 232, after receiving the workload assignment informationfrom the cloud control layer 204, the cloud instantiation layer 206 candistribute the workload across the prescribed deployment based on theworkload assignment. Furthermore, as part of the workload distributionat operation 232, the cloud instantiation layer 206 can provision andconfigure script 238 in order to distribute the workload across theprescribed deployment based on the workload assignment. It is pointedout that operation 232 can be implemented in a wide variety of ways. Forexample, operation 232 can be implemented in any manner similar to thatdescribed herein, but is not limited to such.

At operation 222 of FIG. 2, the cloud control layer 204 can receive oneor more cost constraints for service providers or vendors. It is notedthat operation 222 can be implemented in a wide variety of ways. Forexample in one embodiment, the one or more cost constraints for serviceproviders can be received by the cloud control layer 204 at operation222 from a user interface of a computing device or system. Note thatoperation 222 can be implemented in any manner similar to that describedherein, but is not limited to such.

At operation 224, the cloud control layer 204 can receive one or morecompliance requirements for service providers or vendors. Note thatoperation 224 can be implemented in a wide variety of ways. For examplein an embodiment, the one or more compliance requirements for serviceproviders can be received by the cloud control layer 204 at operation224 from a user interface of a computing device or system. It is pointedout that operation 224 can be implemented in any manner similar to thatdescribed herein, but is not limited to such.

At operation 226 of FIG. 2, the cloud control layer 204 can receive oneor more capability requirements for service providers or vendors. Notethat operation 224 can be implemented in a wide variety of ways. Forexample in one embodiment, the one or more capability requirements forservice providers can be received by the cloud control layer 204 atoperation 226 from a user interface of a computing device or system. Itis noted that operation 226 can be implemented in any manner similar tothat described herein, but is not limited to such.

At operation 236, the cloud instantiation layer 206 can acquire andtransmit capability information for one or more service providers to thecloud control layer 204. It is pointed out that operation 236 can beimplemented in a wide variety of ways. For example, operation 236 can beimplemented in any manner similar to that described herein, but is notlimited to such.

At operation 220 of FIG. 2, the cloud control layer 204 can utilize thereceived compliance requirements for service providers, the receivedcapability requirements for service providers, and the receivedcapability information for one or more service providers in order tofilter out allowable service providers that are able to satisfy therequirements. R is noted that operation 220 can be implemented in a widevariety of ways. For example, operation 220 can be implemented in anymanner similar to that described herein, but is not limited to such.

At operation 218, in order to scale the compute platform capacity topotentially eliminate the one or more bottlenecks determined atoperation 214, the cloud control layer 204 can utilize the received oneor more cost constraints and the allowable service providers todetermine sizing and sourcing from each service provider or vendor.Additionally, the cloud control layer 204 can transmit or output atoperation 218 sizing and sourcing information to the cloud instantiationlayer 206. It is pointed out that operation 218 can be implemented in awide variety of ways. For example in one embodiment, the sizing andsourcing determination at operation 218 by the cloud control layer 204can include the assignment of roles (e.g., primary and redundant, orworker and aggregator), but is not limited to such. In addition, theoutput of the cloud control layer 204 at operation 218 can be a numberand type of virtual machine appliances from each vendor for the cloudinstantiation layer 206 to provision, but is not limited to such.Furthermore, the output of the cloud control layer 204 at operation 218may prescribe a vendor provided platform that serves the basicfunctionality, or the details of building the platform, or a number ofvirtual machine appliances that augment or replicate this basicfunctionality. Operation 218 can be implemented in any manner similar tothat described herein, but is not limited to such.

At operation 234 of FIG. 2, after receiving the sizing and sourcinginformation from the cloud control layer 204, the cloud instantiationlayer 206 can provision and configure script 240 in order to implementthe sizing and sourcing from each service provider or vendor. Moreover,the cloud instantiation layer 206 at operation 234 can also output ortransmit the received sizing and sourcing information to the cloudcontrol layer 204 for use at operation 228. Note that operation 234 canbe implemented in a wide variety of ways. For example, operation 234 canbe implemented in any manner similar to that described herein, but isnot limited to such.

Below is described the formulation by the Cloud Control Layer 204 forscaling virtual appliances to meet performance and availability. Forexample in an embodiment, first a number is chosen of primary activenodes needed to meet performance constraints. Then a determination ismade of the number of additional redundant nodes needed to guaranteeavailability. Taking these numbers along with a cost calculation thatmultiplies the number of resources and their cost; the result is asystem of equations for the Cloud Control Layer 204 to determine therequired resources.

In one embodiment, a determination is first made by the Cloud ControlLayer 204 of the number of primary nodes needed to meet performanceconstraints. At the highest level, the Cloud Control Layer 204 can in anembodiment consider a virtual machine appliance or when horizontallyscalable a group of virtual machine appliances each fulfilling differentroles (e.g., load-balancer and worker node). Note that the functionalityof each role is not interchangeable so the composite performance isbased on the performance of each role. For example, the bottleneck canbe the minimum capacity between the roles. Note that if there is avendor packaged platform already in existence, it can be considered onits own by the Cloud Control Layer 204. For example, the Cloud ControlLayer 204 can consider the maximum capacity it can receive from thatvendor packaged platform. It is noted that the vendor capacity can thenbe augmented by other vendors or from a sourcing of virtual appliances.

In an embodiment, the focus can be on the determination of each roleseparately. In the case where the appliance does not horizontally scale,then the number of nodes can be fixed by the Cloud Control Layer 204.For example, if the load balancer appliance does not scale, there can beone active node.

In one embodiment, one case is horizontally scalable where appliancesperform the same functionality in parallel. Generally this case isadditive in capacity by increasing the number of primary nodes N, thecapacity increases linearly with N. The performance constraint variesbased on the type of workload: events, sessions, or batch items (whichare considered below). The number of active primary worker nodes N canbe scaled by the Cloud Control Layer 204 to meet performanceconstraints. For example, if a maximum or average capacity is associatedper node, a scheme (or technique) scales N so that the sum of themaximum or average capacity of the N nodes is bounded by some number.Another scheme (or technique) by the Cloud Control Layer 204 isdescribed below in which the probability that the capacity of N nodesnot being able to fulfill demand.

Within FIG. 2, in an embodiment event-based appliances replicate code sothat all worker nodes offer the same service. The events to be processedcan arrive within the same queue and then are load-balanced across Nworker nodes. The number of workers N can be determined by the CloudControl Layer 204 to provide the desired performance.

In one embodiment the cloud control layer 204 can let T be the time toprocess one event. Guarantees around T determine the performance ofevent-based systems. For example, find the smallest N so that theprobability of exceeding delay threshold T_(threshold) is bounded by P,in notation that is:Pr(T≧T _(threshold))≦P.

This system can be modeled by the cloud control layer 204 as an M/M/Nqueue with arrival rate λ, where arrivals experience exponential servicetime μ, and are served by N similar and independent servers. The cloudInstantiation Layer 206 can provide estimates of these parameters bymonitoring the average inter-arrival time of events (1/λ) and theaverage time for a unit to be processed (1/μ).

In addition within FIG. 2, the cloud control layer 204 can let W(t)denote cumulative distribution function (CDF) of the waiting time in thesystem that includes time in the queue and the service time. This CDFgives:Pr(T≧T _(threshold))=1−W(T _(threshold)).In one embodiment, the cloud control layer 204 can selectN=min{n:W(T_(threshold))≧1−P}, the smallest non-negative integer N thatmeets the constraint.

It is noted that in an embodiment, session-based appliances load-balanceworker nodes by sessions. Each appliance has a known capacity C numberof sessions. In one embodiment, the cloud control layer 204 canprovision N so that at any time the available system capacity NC isgreater than the number of desired active sessions, N is chosen to boundthe probability of session blocking to be less than P.

Within FIG. 2, the desired number of sessions can be modeled by thecloud control layer 204 as an M/M/∞ queue wherein sessions arrive withexponential rate λ, depart at an exponential rate μ. In one embodiment,the cloud control layer 204 can assume that λ and μ vary slowly enoughfor the cloud Instantiation Layer 206 to estimate. The cloud controllayer 204 can let X_(t) be the number of session requests at time t.

Note that in an embodiment the cloud control layer 204 can suppose N isperiodically chosen with period T>T_(start) the start-up time of a newmachine. At the start of the period time 0, X₀=n. Then N can be updatedso that for the hitting time:τ(NC)=min{t>0:Xt=NC},Pr(τ(NC)<T|X ₀ =n)≦P.

In an embodiment, the cloud control layer 204 can assume in batch jobsthat the items to be processed are already collected as part of aseparate process. In addition, the worker nodes can process their itemsin parallel and present a summarized result for aggregation.

In one embodiment of FIG. 2, the cloud control layer 204 can let T bethe time to process all items. Furthermore, the number of worker nodes Ncan be evaluated to meet some bound T on the time to complete. Forexample in an embodiment, the cloud control layer 204 can re-evaluate Ncontinuously to guarantee that the probability the batch is completedwithin the allotted time.

The cloud control layer 204 can assume in an embodiment the work isevenly partitioned so that with N worker nodes and a total workload Witems that each node processes about k=┌W/N┐ jobs. It is noted that fora given worker, the time to process k events is X=X₁+ . . . +X_(k) forthe cloud control layer 204 where the sequence X_(i) is the time toprocess items i is independent and identically (or similarly)distributed with mean d and variance σ². Then for one worker theprobability that it finishes in T is approximately:

$\rho = {1 - {Q\left( \frac{T - {kd}}{\sigma\left. \sqrt{}k \right.} \right)}}$where Q(.) is the cumulative distribution function (CDF) of the standardnormal (Central Limit Theorem). Since each worker node has independentoperations, the cloud control layer 204 can choose N={min k≧0: ρ^(k)≧P}so that the probability the batch job completes on time is large.

In one embodiment, given the primary nodes N, the number and sourcing ofredundant nodes can be determined by the cloud control layer 204 thatare needed to meet availability guarantees. Note that in an embodimentthe cloud control layer 204 can be given the appliance and provideravailability. As such, the cloud control layer 204 can let q_(i) denotethe availability of an appliance of type i. Furthermore, q_(i) can beindependent across providers. The cloud control layer 204 can let P_(j)denote the availability of provider j. Note that P_(j) can account forthe availability of the network and of the provider so that applianceslocated in the same data center region are dependent and subject to thesame availability (e.g., EC2 regions). It is noted that resources acrossregions can be independent. It is pointed out that like performancedescribed above, the availability of a vendor package platform can beconsidered on its own by the cloud control layer 204. Or the vendorpackage platform can be considered by the cloud control layer 204 as acompanion with other vendors or from the platform derived from aself-sourced collection of virtual appliances.

Within an embodiment, two cases can be considered by the cloud controllayer 204. The first case deals with appliances serving the samefunction. Then the desired redundancy can be obtained by the cloudcontrol layer 204 by limiting the maximum number of failures from a setof indistinct components. Otherwise, the second case deals withappliances that serve distinct functions and the composite availabilityis the product of the availability of each of the components. A nestedformulation can be generated of these compositions by the cloud controllayer 204 by conditioning on the availability of the providers.

FIG. 3 is an example deployment 300 in accordance with variousembodiments of the disclosure. Specifically, application A 302 can be aworker that works or operates in parallel. Note that while all nodes maybe active to maximize performance, the cloud control layer 204 canestablish that N_(A)=2 for capacity constraints. In addition,application B 304 can be an aggregator and does not scale so there isN_(B)=1 active and the others are standby. It is pointed out that thedeployment 300 is across two providers.

More specifically, the application A 302 can be deployed with threeappliances (e.g., 310, 312, and 314) on cloud 306 and two appliances(e.g., 316 and 318) in data center 308, and the application B 304 hastwo appliances (e.g., 320 and 322) on the cloud 306 and one appliance(e.g., 324) in the data center 308. It is noted that each of appliances310, 312, 314, 316, and 318 is an appliance of type 1 that is active(which has an availability of q₁). Moreover, each of appliances 320 and324 is an appliance of type 2 that is stand-by (which has anavailability of q₂) while appliance 322 is an appliance of type 2 thatis active (which has an availability of q₂).

Within FIG. 3, the providers (e.g., 306 and 308) can be conditioned onby the cloud control layer 204 so that the number of appliances m isknown. Then the number of failures is a Binomial Distribution withparameters m and 1−q_(i). The cloud control layer 204 can let F_(m,i)(x)be the CDF for this Binomial. Within the present embodiment, eachappliance in an application can be interchangeable so the availabilityof the application A 302 can be determined by the cloud control layer204 using:F _(5,A)(3)P _(C) P _(D) +F _(3,A)(1)P _(C)(1−P _(D))+F _(2,A)(0)(1−P_(C))P _(D).Similarly the availability of the application B 304 can be determined bythe cloud control layer 204 using:F _(3,B)(2)P _(C) P _(D) +F _(2,B)(2)P _(C)(1−P _(D))+F _(1,B)(0)(1−P_(C))P _(D).Since the applications A 302 and B 304 are distinct, the availability ofthe Composite Application A and B can be determined by the cloud controllayer 204 using:F _(5,A)(3)F _(3,B)(2)P _(C) P _(D) +F _(3,A)(1)F _(2,B)(2)P _(C)(1−P_(D))F _(2,A)(0)F _(1,B)(0)(1−P _(C))P _(D).

It is noted that the automated algorithms of the cloud control layer 204can leverage cloud's scalability to create a platform that meetsuser-defined reliability objectives. Note that the separation ofconcerns allows the cloud control layer 204 to remain generic. Scalingalgorithms may be reused by the cloud control layer 204 for applicationsand platforms based on, but not limited to, specified applicationrequirements, published or estimated provider values, and broadcategorizations along with handling of events, sessions, or batch.

In an embodiment, cloud can be on-demand, elastic, and pay-per-use. Notethat in one embodiment, the cloud control layer 204 sources acombination of basic external and internal computing satisfying thiscloud definition to meet custom service level agreements (SLAs).Therefore, for one-time provisioning and for real-time use, the cloudcontrol layer 204 can leverage cloud's dynamics to address reliabilityconcerns.

FIG. 4 illustrates an example of creating custom services on top ofcommodity cloud and data center offerings in accordance with variousembodiments of the disclosure. In this manner, a company or organizationcan guarantee application service level agreements (SLAs) above cloudproviders or vendors. In one embodiment, a generic cloud control layer402 can compose less commodity cloud and data center resources. Forexample in an embodiment, the cloud control layer 402 can scale, size,and source the cloud providers or vendors, as described herein. Inaddition, the cloud control layer 402 can assign workload to the cloudproviders or vendors, as described herein. Moreover, the cloud controllayer 402 can receive monitoring and can repeat the above operations, asdescribed herein.

Specifically, FIG. 4 illustrates different embodiments wherein the cloudcontrol layer 402 can be application generic regarding, but not limitedto, a storefront portal 404 (which can be dependent on order system), anorder system 406 (wherein the constraint can remain internal), andanalytics 408 (wherein the constraint can remain internal). It ispointed out that the same cloud control layer 402 can be implemented foreach of the different applications 404, 406, and 408. In addition, thecloud control layer 402 can be platform generic.

It is noted that the present embodiment illustrates that the cloudcontrol layer 402 can be application generic since it operates in asimilar manner while handling different cloud applications. For example,the storefront portal application 404, the order system application 406,and the analytics application 408 can each involve different performancecriteria or requirements from the cloud control layer 402. For examplein one embodiment, the storefront portal application 404 can include,but is not limited to, an access availability of 99.99%, a maximumlatency of 300 milliseconds (ms), and an average latency of 150 ms,which can all be dependent on an order system. Furthermore, the ordersystem application 406 can include, but is not limited to, an accessavailability of 99.999%, a maximum latency of 300 ms, and an averagelatency of 150 ms, herein a constraint is that things remain internal.In addition, the analytics application 408 can include, but is notlimited to, a turn-around of less than 24 hours (hr) and a cost within$800, wherein a constraint is that things remain internal. Given thedifferences of the storefront portal application 404, the order systemapplication 406, and the analytics application 408, it is pointed outthat the cloud control layer 402 can operate in a similar manner whilehandling these different cloud applications.

Within FIG. 4, it is noted that within the present embodiment the cloudcontrol layer 402 can be platform generic. Specifically, the logicand/or functionality of the cloud control layer 402 remains unchangedwhile it manages as one or more different cloud platforms. For examplein one embodiment, the cloud control layer 402 can operate incombination with, but is not limited to, a data center 412, a datacenter 414, a Windows Azure platform 416, and AWS (Amazon Web Services)platform 418, wherein each can involve different type of platform.

FIG. 5 illustrates an example of how to fulfill client requirements at acloud control layer 512 using less reliable data center and externalcloud parts in accordance with various embodiments of the disclosure.For example in one embodiment, one or more desired cloud requirements502 can be input (e.g., via a user interface or a configuration file) orreceived by the cloud control layer 512. Note that the desired inputcloud requirements 502 can be defined by a user and/or a client, but isnot limited to such. The one or more desired input cloud requirements502 can include a wide variety of metrics and/or constraints. Forexample within the present embodiment, the one or more desired inputcloud requirements 502 can include, but are not limited to, one or morecompletion metrics 504, an availability metric 506, a performance metric508, and one or more constraints 510.

More specifically, within the present embodiment the one or morecompletion metrics 504 can include, but are not limited to, a turnaround time in 4 hours and a less than 1% error with 95% confidence. Inaddition, the availability metric 506 can include, but is not limitedto, a 99.999% uptime while the performance metric 508 can include, butis not limited to, user requests responded to in less than 350 ms.Furthermore, the one or more constraints 510 can include, but is notlimited to, records remain on premise and costs within $800 per day.

Within FIG. 5, the cloud control layer 512 can include one or moremechanisms or functions 514. Moreover, the cloud control layer 512 canreceive one or more cloud variables 516 and one or more demand variables518. It is pointed out that the one or more mechanisms or functions 514of the cloud control layer 512 can include a wide variety of mechanismsor functions. For example in the present embodiment, the mechanisms 514of the cloud control layer 512 include, but are not limited to,sourcing, scaling, scheduling, along with assignment and flow control(which can include job partitioning and data replication) of cloud anddata center service needed in order to meet the desired inputrequirements 502. Note that the mechanisms or functions 514 allow thecloud control layer 512 to be dynamic and responsive to changes indemand or conditions (e.g., able to respond to a longer than expectedprocessing times or shortened turn around window).

In one embodiment, it is pointed out that the mechanisms or functions514 of the cloud control layer 512 can be based on the received cloudvariables 516 and the received demand variables 518. As such, thereceived cloud variables 516 and the received demand variables 518 canbe taken into consideration by the mechanisms or functions 514 of thecloud control layer 512. The one or more cloud variables 516 caninclude, but are not limited to, machine and provider elements orvariables such as, but not limited to, failure, capacity, throughput,time to start, cost, and capability. Additionally, the one or moredemand variables 518 can include, but are not limited to, activesessions, current processing time, and job sizes.

Within FIG. 5, the cloud control layer 512 can output (e.g., to thecloud instantiation layer 106) one or more scaling and assignments 520.The one or more scaling and assignments 520 can include a wide varietyof information and/or instructions. For example within the presentembodiment, the one or more scaling and assignments 520 can include, butare not limited to, what to buy 522, when to buy 524, and what to dowith it 526.

More specifically, within the present embodiment the what to buy 522 caninclude, but are not limited to, buying five Amazon Web Servers plus twodata centers. Additionally, the when to buy 524 can include, but is notlimited to, once the threshold is at 80%, start two additional AmazonWeb Servers. Moreover, the what to do with it 526 can include, but isnot limited to, bad balance across five primary nodes and duplicate two.It is understood that the what to buy 522, the when to buy 524, and thewhat to do with it 526 can be implemented with any number and/or valuesof elements and are not limited in anyway to those shown in the presentembodiment.

FIG. 6 is a user interface 600 for configuring the control layer 604 inaccordance with various embodiments of the disclosure. It is understoodthat a user can utilize the user interface 600 to input user specifiedbespoke (or indicated or requested) service level agreements (SLAs).Specifically, within the user interface 600 of FIG. 6, the user canspecify a series of desired service level agreements for a selectedapplication. In one embodiment the user interface 600 can be implementedas a graphical user interface (GUI), but is not limited to such.

In one embodiment, the user interface 600 can be implemented to includemultiple tabs. For example within the present embodiment, the userinterface 600 can include a “Specify Traffic” tab 602, a “SpecifyDeployment” tab 604, a “View Traffic SLA Status” tab 606, a “ViewDeployment SLA Status” tab 608, and an “Update Settings” tab 610, but isnot limited such. Note that the “Specify Traffic” tab 602 is shownselected within the user interface 600. In addition, the examplecontents of the “Specify Traffic” tab 602 can include, but is notlimited to, a “TrafficSLAs” sub-tab 612 and a “Traffic-Application”sub-tab 624. It is noted that the “TrafficSLAs” sub-tab 612 is shownselected and its example contents can include, but is not limited to, a“New TrafficSLA” activation button 614, an “Update TrafficSLA”activation button 616, and a “Delete TrafficSLA” activation button 618.

Within FIG. 6, when a user selects the “Update TrafficSLA” activationbutton 616, the “Choose TrafficSLA to update” selection area 622 can bedisplayed thereby enabling the user to select the application he or shedesires to update and/or view. For example, within the presentembodiment of the “Choose TrafficSLA to update” selection area 622, theuser can select to update the service level agreements of “All” theapplications, the “Web” application, or a “DB” application. Once anapplication is selected within the “Choose TrafficSLA to update”selection area 622, one or more service level agreements can bedisplayed which can enable the user to update the one or more displayedservice level agreements. For example within the present embodiment,when the Web application is selected within the “Choose TrafficSLA toupdate” selection area 622 as indicated by the highlighting of the word“Web”, one or more service level agreement fields 620 can be displayed.Specifically, the one or more service level agreement fields 620 caninclude, but are not limited to, a label, a cost, an availability, alatency, a capacity, and/or a blocking probability associated with theselected application or applications of the “Choose TrafficSLA toupdate” selection area 622.

Note that the user interface 600 may not include all of the elementsillustrated by FIG. 6. Additionally, the user interface 600 can beimplemented to include one or more elements not illustrated by FIG. 6.It is pointed out that the user interface 600 of FIG. 6 can be utilizedor implemented in any manner similar to that described herein, but isnot limited to such.

FIG. 7 illustrates both the “Specify Traffic” tab 602 and the“Traffic-Application” sub-tab 624 selected within the user interface 600in accordance with various embodiments of the disclosure. It is pointedout that the contents of the Traffic-Application sub-tab 624 can beutilized by a user to associate specific roles with a specifiedapplication.

Within the present embodiment, the example contents of theTraffic-Application sub-tab 624 can include, but is not limited to, arole identifier drop-down menu selector 702, a “Current ApplicationList” display area 704, an “Available Application List” display area706, an “Add” activation button 708, and a “Remove” activation button710. It is pointed out that the role identifier drop-down menu selector702 can be utilized by a user to assign or associate one or moreselected roles to a particular application. For instance within thepresent embodiment, the Web application listed within the CurrentApplication List display area 704 currently has a web role as indicatedby the role identifier drop-down menu selector 702.

Within FIG. 7, it is pointed out that if the Add activation button 708is selected by a user while the LoadBalance application is highlightedas shown within the Available Application List display area 706, theLoadBalance application will be associated or assigned a web role whileits label (or name or identifier) will be added to the CurrentApplication List display area 704. Note that the Remove activationbutton 710 can be utilized by a user to remove and disassociate anapplication identifier from the Current Application List display area704. For example in an embodiment, after a user selects or highlightsthe identifier of an application listed within the Current ApplicationList display area 704, the Remove activation button 710 is displayedactive and usable while the Add activation button 708 is displayedinactive and unusable. Thereafter, the user can select the Removeactivation button 710 resulting in the highlighted applicationidentifier being removed from the Current Application List display area704. In addition, the removed application is disassociated or unassignedfrom its web role.

FIG. 8 illustrates the “Specify Deployment” tab 604 selected within theuser interface 600 in accordance with various embodiments of thedisclosure. The contents of the Specify Deployment tab 604 can beutilized by a user to view and specify the primary deployment and backupdeployment associated with one or more applications.

Within the present embodiment, the example contents of the SpecifyDeployment tab 604 can include, but is not limited to, a “Get OptimalDeployment” activation button 806, an “Update Deployment” activationbutton 808, an “Optimize All Deployment” activation button 810, a“Primary Deployment” information fields 802, a “Backup Deployment”information fields 804, and a “Choose Deployment to update” selectionarea 812. Note that the “Get Optimal Deployment” activation button 806can also be referred to as the “Get Best Possible Deployment” activationbutton 806 or the “Get Appropriate Deployment” activation button 806,but is not limited to such. Furthermore, the “Optimize All Deployment”activation button 810 can also be referred to as the “EnhanceEffectiveness of All Deployment” activation button 810 or “GetAppropriate Deployment for All” activation button 810, but is notlimited to such. It is pointed out that when the Update Deploymentactivation button 808 is selected by a user, the user interface 600displays the “Choose Deployment to update” selection area 812 along withthe Primary Deployment information fields 802 and Backup Deploymentinformation fields 804 that correspond to the highlighted applicationwithin the “Choose Deployment to update” selection area 812.

Within FIG. 8, note that the “Choose Deployment to update” selectionarea 812 can be utilized by a user to select a particular application toupdate and/or view its corresponding primary deployment and backupdeployment. For example within the present embodiment, the Webapplication label is highlighted or selected within the “ChooseDeployment to update” selection area 812 thereby resulting in thedisplay of its corresponding Primary Deployment information fields 802and Backup Deployment information fields 804. It is noted that thegrayed out provider fields shown within the Primary Deploymentinformation fields 802 and the Backup Deployment information fields 804have been filtered out by the control layer 604.

It is pointed out that if the Optimize All Deployment activation button810 is selected by a user, the user interface 600 applies the optimum(e.g., minimum cost) or best possible or appropriate deployment thatsatisfies the service level agreements. Additionally, if the Get OptimalDeployment activation button 806 is selected by a user, the userinterface 600 determines and highlights the application listed withinthe “Choose Deployment to update” selection area 812 that currently hasthe optimal (e.g., minimum cost) or best possible or appropriatedeployment.

FIG. 9 illustrates the “View Traffic SLA Status” tab 606 selected withinthe user interface 600 in accordance with various embodiments of thedisclosure. The contents of the “View Traffic SLA Status” tab 606 can beutilized by a user to view composite application service levelagreements (SLAs) that include the actual (or measured) applicationservice level agreements (based on deployed) along with the userspecified application service level applications.

Within the present embodiment, the example contents of the “View TrafficSLA Status” tab 606 can include, but is not limited to, an applicationidentifier drop-down menu selector 902, a Cost graph 904, a Capacitygraph 906, a Latency graph 908, and an Availability graph 910. It isnoted that the application identifier drop-down menu selector 902 can beutilized by a user to select which composite application service levelagreements to view. For example within the present embodiment, the “Web”application is currently selected within the application identifierdrop-down menu selector 902. As such, the Cost graph 904, Capacity graph906, Latency graph 908, and Availability graph 910 display the compositeservice level agreements for the Web application.

Specifically within FIG. 9, it is pointed out that the black bar shownwithin each of the Cost graph 904, Capacity graph 906, Latency graph908, and Availability graph 910 represents the user defined applicationservice level agreement value while the gray bar within each graphrepresents the actual or measured service level agreements based ondeployed cloud components. More specifically, the X-axis of the Costgraph 904 represents the United States Dollars (USD) per hour (hr)service level agreement while the X-axis of the Capacity graph 906represents the capacity for user sessions service level agreement.Furthermore, the X-axis of the Latency graph 908 represents the latencyin milliseconds (ms) service level agreement while the X-axis of theAvailability graph 910 represents the percent of time the application isavailable (e.g., 99.999% of the time) service level agreement.

FIG. 10 illustrates the “View Deployment SLA Status” tab 608 selectedwithin the user interface 600 in accordance with various embodiments ofthe disclosure. The contents of the “View Deployment SLA Status” tab 608can be utilized by a user to view how a specific application servicelevel agreement (SLA) can be split or divided up by role.

Within the present embodiment, the example contents of the “ViewDeployment SLA Status” tab 608 can include, but is not limited to, anapplication identifier drop-down menu selector 1002, a service levelagreement attribute drop-down menu selector 1004, a first role graph1006, and a second role graph 1008. Note that the application identifierdrop-down menu selector 1002 can be utilized by a user to select whichapplication service level agreements to view. In addition, the servicelevel agreement attribute drop-down menu selector 1004 can be utilizedby the user to select which service level agreement attribute to view ofthe selected application.

Within the present embodiment of FIG. 10, it is pointed out that the“All” application is currently selected with the application identifierdrop-down menu selector 1002. Furthermore, the service level agreementattribute drop-down menu selector 1004 is currently selected to the“Cost” attribute. Given the above user selections within the presentembodiment, the user interface 600 displays the “DB” role graph 1006 andthe “Web” role graph 1008 which illustrate that the cost of the “All”application is split between the “DB” role and the “Web” role.

FIG. 11 illustrates the “Update Settings” tab 610 selected within theuser interface 600 in accordance with various embodiments of thedisclosure. Note that the contents of the “Update Settings” tab 610 canbe utilized by a user to update and view one or more settings associatedwith one or more cloud providers, one or more applications, andresetting a simulation, but is not limited to such.

Specifically within the present embodiment, the contents of the “UpdateSettings” tab 610 includes, but is not limited to, a “Providers” sub-tab1102, an “Application” sub-tab 1112, and a “Reset Simulation” sub-tab1114. It is noted that the “Providers” sub-tab 1102 is shown selectedthereby enabling a user to update and view one or more settingsassociated with one or more cloud providers. The example contents of the“Providers” sub-tab 1102 can include, but is not limited to, a “ClearProvider” activation button 1104, an “Update Provider” activation button1106, a “Choose Provider to update” selection area 1110, and CloudProvider information fields 1108.

Within FIG. 11, it is pointed out that when the “Update Provider”activation button 1106 is selected by a user, the user interface 600 candisplay the “Choose Provider to update” selection area 1110 along withone or more Cloud Provider information fields 1108 that correspond tothe highlighted or selected provider within the “Choose Provider toupdate” selection area 1110. In one embodiment, the Cloud Providerinformation fields 1108 can include, but are not limited to, a providerlabel field, a provider cost field, a provider availability field, aprovider latency field, and a provider capacity field. It is noted thatthe metrics of a provider shown within the Cloud Provider informationfields 1108 can be obtained either via estimated values from actualmeasurements of the provider or from published (or known) values, but isnot limited to such.

FIG. 12 illustrates both the “Update Settings” tab 610 and the“Application” sub-tab 1112 selected within the user interface 600 inaccordance with various embodiments of the disclosure. It is pointed outthat the contents of the “Application” sub-tab 1112 can be utilized by auser to update and view one or more settings of a specified cloudapplication, delete an existing cloud application, and add a new cloudapplication along with its corresponding settings, but is not limited tosuch.

Within the present embodiment, the example contents of the “Application”sub-tab 1112 can include, but is not limited to, a “New Application”activation button 1202, an “Update Application” activation button 1206,and a “Delete Application” activation button 1208. Note that when a userselects the “Update Application” activation button 1206, a “ChooseApplication to update” selection area 1210 can be displayed therebyenabling the user to select the application settings he or she desiresto update and/or view. For example, within the present embodiment of the“Choose Application to update” selection area 1210, the user can toupdate the settings of the “Web” application, the “DB” application, a“Batch” application, or a “LoadBalance” application. Once an applicationis selected within the “Choose Application to update” selection area1210, one or more application provider capacity fields 1204 can bedisplayed thereby enabling the user to view and/or update one or more ofthem. R is noted that the capacity of a provider may be updatedutilizing either estimated values from actual measurements of theprovider or from published (or known) values, but is not limited tosuch.

Specifically within the present embodiment of FIG. 12, when the “Web”application is selected within the “Choose Application to update”selection area 1210 as indicated by the highlighting of the word “Web”,one or more provider capacity fields 1204 corresponding to the “Web”application can be displayed by the user interface 600. For example inone embodiment, the “Web” application has a capacity of 500 sessions forthe “amazon_ec2_east” provider, a capacity of 500 sessions for the“amazon_ec2_west” provider, a capacity of 200 sessions for the“data_center_(—)1”provider, and a capacity of 200 sessions for the“data_center_(—)2” provider. It is pointed out that the grayed outprovider fields shown within the provider capacity fields 1204 have beenfiltered out by the control layer 104.

Note that the user interface 600 may not include all of the elementsillustrated by FIGS. 6-12. Moreover, the user interface 600 can beimplemented to include one or more elements not illustrated by FIGS.6-12. It is pointed out that the user interface 600 of FIGS. 6-12 can beutilized or implemented in any manner similar to that described herein,but is not limited to such.

FIG. 13 is a block diagram of an exemplary computing system 1300 thatmay be used in accordance with various embodiments of the disclosure. Itis understood that system 1300 is not strictly limited to be a computingsystem. As such, system 1300 is well suited to be any type of computingdevice (e.g., server, desktop computer, laptop computer, portablecomputing device, database computer system, handheld computing device,etc.) in accordance with various embodiments of the disclosure. In itsvarious implementations, system 1300 may not include all of the elementsillustrated by FIG. 13, or system 1300 may include other elements notshown by FIG. 13. Within discussions of various embodiments inaccordance with the disclosure herein, certain processes and operationsare discussed that can be realized, in some embodiments, as a series ofinstructions (e.g., software program, modules, and the like) that residewithin computer readable memory or storage of computing system 1300 andexecuted by a processor(s) of system 1300. When executed, theinstructions can cause computer 1300 to perform specific operations andexhibit specific behavior which are described herein, but are notlimited to such.

Computer system 1300 can include an address/data bus 1310 forcommunicating information, one or more central processors 1302 coupledwith bus 1310 for processing information and instructions. Centralprocessor unit(s) 1302 may be a microprocessor or any other type ofprocessor. The computer 1300 can also include data storage features suchas computer usable volatile memory 1304, e.g., random access memory(RAM), static RAM, dynamic RAM, etc., coupled with bus 1310 for storinginformation and instructions for central processor(s) 1302, computerusable non-volatile memory 1306, e.g., read only memory (ROM),programmable ROM, flash memory, erasable programmable read only memory(EPROM), electrically erasable programmable read only memory (EEPROM),etc., coupled with bus 1310 for storing static information andinstructions for processors) 1302.

System 1300 of FIG. 13 can also include one or more signal generatingand receiving devices 1308 coupled with bus 1310 for enabling system1300 to interface with other electronic devices. The communicationinterface(s) 1308 of the present embodiment may include wired and/orwireless communication technologies. For example in one embodiment, thecommunication interface 1308 is a serial communication port, but couldalso alternatively be any of a number of well known communicationstandards and protocols, e.g., a Universal Serial Bus (USB), an Ethernetadapter, a FireWire (IEEE 1394) interface, a parallel port, a smallcomputer system interface (SCSI) bus interface, an infrared (IR)communication port, a Bluetooth wireless communication adapter, abroadband connection, and the like. In an embodiment, a cable or digitalsubscriber line (DSL) connection may be employed. In such a case thecommunication interface(s) 1308 may include a cable modem or a DSLmodem.

Optionally, computer system 1300 can include an alphanumeric inputdevice 1314 including alphanumeric and function keys coupled to the bus1310 for communicating information and command selections to the centralprocessor(s) 1302. The computer 1300 can also include an optional cursorcontrol or cursor directing device 1316 coupled to the bus 1310 forcommunicating user input information and command selections to theprocessor(s) 1302. The cursor directing device 1316 can be implementedusing a number of well known devices such as, but not limited to, amouse, a track ball, a track pad, an optical tracking device, a touchscreen, etc. Alternatively, it is appreciated that a cursor can bedirected and/or activated via input from the alphanumeric input device1314 using special keys and key sequence commands. The presentembodiment is also well suited to directing a cursor by other means suchas, for example, voice commands.

Within FIG. 13, the system 1300 can also include an optional computersable mass data storage device 1318 such as a magnetic or optical diskand disk drive (e.g., hard drive or floppy diskette) coupled with bus1310 for storing information and instructions. The system 1300 caninclude an optional display device 1312 coupled to bus 1310 fordisplaying video and/or graphics. Note that the optional display device1312 may be implemented with different technologies. For example, theoptional display device 1312 may be implemented with, but is not limitedto, a cathode ray tube (CRT), flat panel liquid crystal display (LCD),field emission display (FED), plasma display, light emitting diode (LED)display, or any other display device suitable for displaying videoand/or graphic images and alphanumeric characters recognizable to auser.

Note that the volatile memory 1304 may store a cloud management systemor module 1320 in accordance with various embodiments of the disclosure.The cloud management system or module 1320 may include instructions tocause the system 1300 to operate or function in any manner similar tothat described herein, but not limited to such. It is pointed out thatin various embodiments, the cloud management system or module 1320 (orone or more of its components) may be stored by the volatile memory1304, or the non-volatile memory 1306, or the mass data storage device1318, or any combination thereof. In various embodiments, the cloudmanagement system or module 1320 can be an implementation of the cloudmanagement system or module 100 of FIG. 1 or the cloud management systemor module 200 of FIG. 2, but is not limited to such.

Within FIG. 13, it is noted that the components associated with system1300 described above may be resident to and associated with one physicalcomputing device. However, one or more of the components associated withsystem 1300 may be physically distributed to other locations and becommunicatively coupled together (e.g., via one or more networks).

Note that the system 1300 may not include all of the elementsillustrated by FIG. 13. Furthermore, the system 1300 can be implementedto include one or more elements not illustrated by FIG. 13. It ispointed out that the system 1300 can be utilized or implemented in anymanner similar to that described herein, but is not limited to such.

The foregoing descriptions of various specific embodiments in accordancewith the disclosure have been presented for purposes of illustration anddescription. They are not intended to be exhaustive or to limit thedisclosure to the precise forms disclosed, and many modifications andvariations are possible in light of the above teaching.

What is claimed is:
 1. A computer-implemented method comprising:obtaining, by a cloud control module of a cloud management system,constraint information associated with a software application;determining, by the cloud control module of the cloud management systemand based on the constraint information associated with the softwareapplication, that a capacity of a cloud architecture is to be scaled bydeploying instances of the software application across multiple cloudcomputing platforms that each are sourced by a different cloud computingplatform provider; receiving, by the cloud control module of the cloudmanagement system, capability information associated with a set ofavailable cloud computing platforms; and selecting, by the cloud controlmodule of the cloud management system, a subset of at least two cloudcomputing platforms that are sourced by at least two different cloudcomputing platform providers from the available cloud computingplatforms to provide (i) code for instances of the software applicationon the cloud computing platforms of the subset and (ii) configurationinformation for configuring the cloud computing platforms of the subsetto execute the code, the selecting based at least on the constraintinformation associated with the software application and the capabilityinformation associated with the set of available cloud computingplatforms.
 2. The method of claim 1, wherein determining, by the cloudcontrol module of the cloud management system and based on theconstraint information associated with the software application, that acapacity of a cloud architecture is to be scaled by deploying instancesof the software application across multiple cloud computing platformsthat each are sourced by a different cloud computing platform provider,comprises: determining that the cloud architecture is not satisfying athreshold indicated by the constraint information; in response todetermining that the cloud architecture is not satisfying the thresholdindicated by the constraint information, determining a bottleneck in thecloud architecture is causing the cloud architecture to not satisfy thethreshold; and in response to determining the bottleneck in the cloudarchitecture is causing the cloud architecture to not satisfy thethreshold, determining that scaling the capacity of the cloudarchitecture reduces an effect of the bottleneck on the cloudarchitecture not satisfying the threshold.
 3. The method of claim 2,wherein determining that the cloud architecture is not satisfying athreshold indicated by the constraint information, comprises: monitoringa performance statistic for the cloud architecture; and determining thatthe performance statistic is not satisfying a corresponding thresholdindicated by the constraint information for the performance statistic.4. The method of claim 2, wherein determining a bottleneck in the cloudarchitecture is causing the cloud architecture to not satisfy thethreshold, comprises: identifying an instance of the softwareapplication that is causing the cloud architecture to not satisfy thethreshold.
 5. The method of claim 1, wherein selecting, by the cloudcontrol module of the cloud management system, a subset of at least twocloud computing platforms that are sourced by at least two differentcloud computing platform providers, comprises: determining thatdeploying the code for the instances of the software application on thesubset of the available cloud computing platforms will satisfy theconstraint information associated with the software application.
 6. Themethod of claim 5, wherein determining that deploying the code for theinstances of the software application on the subset of the availablecloud computing platforms will satisfy the constraint informationassociated with the software application, comprises: determining thatthe capability information associated with the subset of the availablecloud computing platforms indicates that deploying the code for theinstances of the software application on the subset of the availablecloud computing platforms will satisfy the constraint informationassociated with the software application.
 7. The method of claim 1,comprising: providing a cloud transformation module of the cloudmanagement system an indication of the at least two cloud computingplatforms of the subset, the cloud transformation module configured toobtain, for each of the cloud computing platforms of the subset, thecode for the instance of the software application on the cloud computingplatform, where the code is specific to the cloud computing platform. 8.A cloud management system comprising: one or more computers; and one ormore storage devices storing instructions that are operable, whenexecuted by the one or more computers, to cause the one or morecomputers to perform operations comprising: obtaining, by a cloudcontrol module of the cloud management system, constraint informationassociated with a software application; determining, by the cloudcontrol module of the cloud management system and based on theconstraint information associated with the software application, that acapacity of a cloud architecture is to be scaled by deploying instancesof the software application across multiple cloud computing platformsthat each are sourced by a different cloud computing platform provider;receiving, by the cloud control module of the cloud management system,capability information associated with a set of available cloudcomputing platforms; and selecting, by the cloud control module of thecloud management system, a subset of at least two cloud computingplatforms that are sourced by at least two different cloud computingplatform providers from the available cloud computing platforms toprovide (i) code for instances of the software application on the cloudcomputing platforms of the subset and (ii) configuration information forconfiguring the cloud computing platforms of the subset to execute thecode, the selecting based at least on the constraint informationassociated with the software application and the capability informationassociated with the set of available cloud computing platforms.
 9. Thesystem of claim 8, wherein determining, by the cloud control module ofthe cloud management system and based on the constraint informationassociated with the software application, that a capacity of a cloudarchitecture is to be scaled by deploying instances of the softwareapplication across multiple cloud computing platforms that each aresourced by a different cloud computing platform provider, comprises:determining that the cloud architecture is not satisfying a thresholdindicated by the constraint information; in response to determining thatthe cloud architecture is not satisfying the threshold indicated by theconstraint information, determining a bottleneck in the cloudarchitecture is causing the cloud architecture to not satisfy thethreshold; and in response to determining the bottleneck in the cloudarchitecture is causing the cloud architecture to not satisfy thethreshold, determining that scaling the capacity of the cloudarchitecture reduces an effect of the bottleneck on the cloudarchitecture not satisfying the threshold.
 10. The system of claim 9,wherein determining that the cloud architecture is not satisfying athreshold indicated by the constraint information, comprises: monitoringa performance statistic for the cloud architecture; and determining thatthe performance statistic is not satisfying a corresponding thresholdindicated by the constraint information for the performance statistic.11. The system of claim 9, wherein determining a bottleneck in the cloudarchitecture is causing the cloud architecture to not satisfy thethreshold, comprises: identifying an instance of the softwareapplication that is causing the cloud architecture to not satisfy thethreshold.
 12. The system of claim 8, wherein selecting, by the cloudcontrol module of the cloud management system, a subset of at least twocloud computing platforms that are sourced by at least two differentcloud computing platform providers, comprises: determining thatdeploying the code for the instances of the software application on thesubset of the available cloud computing platforms will satisfy theconstraint information associated with the software application.
 13. Thesystem of claim 12, wherein determining that deploying the code for theinstances of the software application on the subset of the availablecloud computing platforms will satisfy the constraint informationassociated with the software application, comprises: determining thatthe capability information associated with the subset of the availablecloud computing platforms indicates that deploying the code for theinstances of the software application on the subset of the availablecloud computing platforms will satisfy the constraint informationassociated with the software application.
 14. The system of claim 8,comprising: providing a cloud transformation module of the cloudmanagement system an indication of the at least two cloud computingplatforms of the subset, the cloud transformation module configured toobtain, for each of the cloud computing platforms of the subset, thecode for the instance of the software application on the cloud computingplatform, where the code is specific to the cloud computing platform.15. A non-transitory computer-readable medium storing softwarecomprising instructions executable by one or more computers which, uponsuch execution, cause the one or more computers to perform operationscomprising: obtaining, by a cloud control module of a cloud managementsystem, constraint information associated with a software application;determining, by the cloud control module of the cloud management systemand based on the constraint information associated with the softwareapplication, that a capacity of a cloud architecture is to be scaled bydeploying instances of the software application across multiple cloudcomputing platforms that each are sourced by a different cloud computingplatform provider; receiving, by the cloud control module of the cloudmanagement system, capability information associated with a set ofavailable cloud computing platforms; and selecting, by the cloud controlmodule of the cloud management system, a subset of at least two cloudcomputing platforms that are sourced by at least two different cloudcomputing platform providers from the available cloud computingplatforms to provide (i) code for instances of the software applicationon the cloud computing platforms of the subset and (ii) configurationinformation for configuring the cloud computing platforms of the subsetto execute the code, the selecting based at least on the constraintinformation associated with the software application and the capabilityinformation associated with the set of available cloud computingplatforms.
 16. The medium of claim 15, wherein determining, by the cloudcontrol module of the cloud management system and based on theconstraint information associated with the software application, that acapacity of a cloud architecture is to be scaled by deploying instancesof the software application across multiple cloud computing platformsthat each are sourced by a different cloud computing platform provider,comprises: determining that the cloud architecture is not satisfying athreshold indicated by the constraint information; in response todetermining that the cloud architecture is not satisfying the thresholdindicated by the constraint information, determining a bottleneck in thecloud architecture is causing the cloud architecture to not satisfy thethreshold; and in response to determining the bottleneck in the cloudarchitecture is causing the cloud architecture to not satisfy thethreshold, determining that scaling the capacity of the cloudarchitecture reduces an effect of the bottleneck on the cloudarchitecture not satisfying the threshold.
 17. The medium of claim 16,wherein determining that the cloud architecture is not satisfying athreshold indicated by the constraint information, comprises: monitoringa performance statistic for the cloud architecture; and determining thatthe performance statistic is not satisfying a corresponding thresholdindicated by the constraint information for the performance statistic.18. The medium of claim 16, wherein determining a bottleneck in thecloud architecture is causing the cloud architecture to not satisfy thethreshold, comprises: identifying an instance of the softwareapplication that is causing the cloud architecture to not satisfy thethreshold.
 19. The medium of claim 15, wherein selecting, by the cloudcontrol module of the cloud management system, a subset of at least twocloud computing platforms that are sourced by at least two differentcloud computing platform providers, comprises: determining thatdeploying the code for the instances of the software application on thesubset of the available cloud computing platforms will satisfy theconstraint information associated with the software application.
 20. Themedium of claim 19, wherein determining that deploying the code for theinstances of the software application on the subset of the availablecloud computing platforms will satisfy the constraint informationassociated with the software application, comprises: determining thatthe capability information associated with the subset of the availablecloud computing platforms indicates that deploying the code for theinstances of the software application on the subset of the availablecloud computing platforms will satisfy the constraint informationassociated with the software application.